Back

The American Journal of Human Genetics

77 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Too rare to be random: genetic finding suggests previously unrecognized path of mutagenesis
2026-03-04 genetic and genomic medicine 10.64898/2026.03.03.26346966
Top 0.1% (21.7%)
Show abstract

We report a previously undescribed genotypic configuration identified in twins with HNRNPU-related neurodevelopmental disorder. Both twins have two closely spaced mosaic variants on the same allele that never co-occur on any single DNA molecule, resulting in three distinct cell lineages within each individual. We define this genotypic configuration as clustered monoallelic mosaicism (cMoMa). Recognizing the extreme improbability of such a configuration, we systematically explore two potential me...

2
Language models reveal evidence gaps in variants of uncertain significance
2026-03-02 genetic and genomic medicine 10.64898/2026.02.28.26347206
Top 0.2% (18.4%)
Show abstract

BackgroundMost rare coding variants in monogenic disease genes remain classified as Variants of Uncertain Significance (VUS), limiting their use in clinical care. Many variant classifications have been submitted to ClinVar, often with rich free-text summaries of the evidence underlying each classification. These narratives are not standardized and are difficult to mine systematically, making it challenging to identify variants that might be reclassified as new evidence becomes available. Method...

3
Clinical, in vitro, and in vivo evidence of WAPL as a novel cohesinopathy gene and phenotypic driver of 10q22.3q23.2 genomic disorder
2026-02-28 genetic and genomic medicine 10.64898/2026.02.23.26346364
Top 0.2% (18.4%)
Show abstract

Cohesin is a fundamental genome-organizing complex that orchestrates three-dimensional chromosome folding and gene expression via DNA loop extrusion. Alterations to genes encoding cohesin subunits and cohesin loaders cause Mendelian disorders, including Cornelia de Lange syndrome (CdLS). By contrast, disruption of factors that remove cohesin from DNA, including WAPL and its binding partners PDS5A and PDS5B, have not yet been associated with human disease. Here, we explored the relevance of these...

4
A time-to-event heritability framework for inferring the genetic architecture of longitudinal traits
2026-02-22 genetic and genomic medicine 10.64898/2026.02.16.26346285
Top 0.2% (18.2%)
Show abstract

Biobanks with longitudinal measurements have advanced our understanding of time-to-event (TTE) traits including age-of-onset and disease progression. However, limited work has characterized the heritability of TTE traits, a key parameter for comparisons of total association and predictive power. Here, we present COXMM, a Cox proportional hazard mixed model for estimating TTE heritability. Simulations show our model achieves nearly unbiased results, whereas non-TTE approaches severely underestima...

5
RNA sequencing resolves cryptic pathogenic variants in mitochondrial disease
2026-02-23 genetic and genomic medicine 10.64898/2026.02.23.26345976
Top 0.3% (17.8%)
Show abstract

BackgroundMitochondrial diseases are the most common inherited metabolic disorders, characterized by pronounced clinical and genetic heterogeneity that complicates molecular diagnosis. Although DNA-based sequencing approaches have become standard in genetic testing, up to half of patients remain without a definitive diagnosis. RNA sequencing (RNA-seq) provides a complementary layer of evidence by revealing functional consequences of genetic variation, thereby improving diagnostic yield. Methods...

6
Novel PCDH12 pathogenic missense variants cause neurodevelopmental disorders with ocular malformation
2026-03-06 neurology 10.64898/2026.03.05.26343794
Top 0.4% (17.3%)
Show abstract

Protocadherin-12 (PCDH12), a cell-adhesion protein belonging to the non-clustered protocadherin family, plays a crucial role in the establishment and regulation of neuronal connections and communication. Bi-allelic loss-of-function (LoF) variants in the PCDH12 gene have been associated with several neurodevelopmental disorders (NDDs) such as diencephalic-mesencephalic junction dysplasia (DMJD) syndrome, cerebral palsy, and cerebellar ataxia, often accompanied by ocular abnormalities. However, ge...

7
PHARMWATCH: A Multilayer Pharmacogenomics Safety System for Accurate Star Allele Interpretation
2026-02-28 genetic and genomic medicine 10.64898/2026.02.26.26347200
Top 0.4% (17.2%)
Show abstract

The Clinical Pharmacogenetics Implementation Consortium (CPIC) bases its drug-gene recommendations on the assignment of star alleles, which map known genotypes to defined functional categories and corresponding drug dosage guidelines. The star allele framework, first proposed in 1996 for the CYP gene family and later formalized with CPICs establishment in 2010 [1, 2], remains foundational to pharmacogenomics. However, this system has notable limitations. Its dependence on a restricted set of ben...

8
Standardized transcriptome analysis improves rare disease diagnosis in the pan-European Solve-RD consortium
2026-02-14 genetic and genomic medicine 10.64898/2026.02.10.26345954
Top 0.5% (15.4%)
Show abstract

RNA sequencing (RNA-seq) provides a powerful complement to DNA sequencing for uncovering pathogenic defects affecting gene expression and splicing in individuals with genetically undiagnosed rare disorders. However, as large rare disease consortia adopt RNA-seq, challenges arise due to cohort heterogeneity, variability in tissues and sample sizes, and differences in interpretation practices. Here, we present a harmonized analytical and interpretation framework developed by the pan-European Solv...

9
Gene Portals: A Framework for Integrating Clinical, Functional, and Structural Evidence into Rare Disease Variant Classification
2026-03-06 genetic and genomic medicine 10.64898/2026.03.05.26347086
Top 0.5% (15.3%)
Show abstract

Rare Mendelian disorders affect 300-400 million people globally. Although genetic testing has become widely adopted, gene-specific evidence for tailored variant interpretation remains scattered across resources. We present Gene Portals, a framework for gene-centered multimodal knowledge bases that co-localize expert-harmonized clinical data, functional assays, population variation, structural annotations and gene-specific ACMG/AMP specifications within a single resource. A modular interface inte...

10
Phenotypic and transcriptomic characterisation of a novel biallelic RNU2-2 developmental and epileptic encephalopathy
2026-02-23 genetic and genomic medicine 10.64898/2026.02.19.26345867
Top 0.6% (14.2%)
Show abstract

A significant proportion of individuals with suspected genetic developmental and epileptic encephalopathies (DEEs) remain unsolved following whole genome sequencing (WGS). We screened individuals who received WGS analyses at Genomic Medicine Centre Karolinska for Rare Diseases for biallelic RNU2-2 variants. Deep phenotyping was performed and phenotypic traits were transcribed to their corresponding Human Phenotype Ontology (HPO) term. HPO terms were used to generate pairwise phenotypic similari...

11
Learning lifetime disease liability reveals and removes genetic confounding in electronic health records
2026-02-22 genetic and genomic medicine 10.64898/2026.02.15.26346336
Top 0.7% (14.2%)
Show abstract

Electronic health records (EHRs) have become the cornerstone of population-scale genetic studies1, but factors including patterns of healthcare use shape which and how diagnoses are recorded, leading to confounding effects in genetic associations with EHR codes2. In this study we propose EDGAR, a deep learning framework that recovers lifetime disease liability from EHR by aligning diagnostic codes with clinically validated measures and disease labels in a set of individuals prioritized through a...

12
Massively parallel functional profiling identifies CCDC88C as a risk gene for ER-positive breast cancer
2026-03-03 genetic and genomic medicine 10.64898/2026.03.02.26347419
Top 0.7% (14.0%)
Show abstract

Genome wide association studies (GWAS), combined with fine-mapping have identified 196 independent signals associated with breast cancer risk. Deciphering the functional basis of these associations can inform our understanding of the biology and aetiology of breast cancer. Decoding GWAS risk associations is challenging due to linkage disequilibrium between variants and because most variants map to non-coding regions, influencing breast cancer risk via cis-regulatory mechanisms that modulate the ...

13
Genome-wide association studies to identify shared and distinct mechanisms of fibrosis across 12 organ-systems
2026-02-19 genetic and genomic medicine 10.64898/2026.02.18.26346458
Top 0.8% (13.6%)
Show abstract

IntroductionFibrosis can affect organs throughout the body and is present in a wide range of diseases. Recent research has suggested that there could be shared biological mechanisms that lead to fibrosis in different organs. MethodsWe performed genome-wide association studies using UK Biobank for fibrosis in 12 different organ-systems and meta-analysed results with previously published studies of fibrotic diseases. We considered genetic associations that colocalised across [≥]3 organs as tho...

14
De novo variants in LDB1 are linked to distinct neurodevelopmental phenotypes determined by variant location and differing pathomechanisms
2026-02-28 genetic and genomic medicine 10.64898/2026.02.26.26347174
Top 0.8% (12.5%)
Show abstract

LDB1 encodes transcriptional regulator protein LIM-domain-binding protein 1, which plays an important role in neurogenesis. Few C-terminal likely gene disrupting (LGD) variants have been reported in the literature in individuals with congenital ventriculomegaly. Through international collaboration, we now assembled a cohort of 16 individuals with de novo variants affecting various regions of LDB1. Eleven variants affect either the whole gene or the N-terminal dimerization domain (including gene ...

15
Building The Human Genotype-Phenotype Map to Harness Pleiotropy and Refine Disease Mechanisms
2026-02-20 genetic and genomic medicine 10.64898/2026.02.19.26346618
Top 0.8% (12.5%)
Show abstract

Mapping the pleiotropic effect of genetic variation on biological processes and complex phenotypes is fundamental to extracting translational insight from genome-wide association studies (GWAS). Here we present The Human Genotype-Phenotype Map (GPMap), a repository of colocalizing genetic associations across 15,997 complex traits and 2.7 million molecular measurements, leveraging common and rare variants and cis-and trans-acting effects across disaggregated tissue types and single cell datasets ...

16
Leveraging genome-wide effects on gene expression to identify disease-critical genes with trans-genetic components
2026-02-25 genetic and genomic medicine 10.64898/2026.02.23.26346922
Top 0.8% (12.5%)
Show abstract

Genome-wide association studies (GWAS) have implicated tens of thousands of genetic variants associated with complex traits and polygenic diseases. Colocalizing GWAS variants with variants that may regulate gene expression, via expression quantitative trait loci (eQTL) mapping, has successfully led to the identification of disease-critical genes and their cell types of action. Recent studies predominantly colocalize proximal cis-eQTLs, which are estimated to regulate [~]10% of variance in gene e...

17
Variant curation of the largest compendium of FOXL2 coding and non-coding sequence and structural variants in BPES
2026-03-02 genetic and genomic medicine 10.64898/2026.02.24.25339471
Top 0.8% (12.3%)
Show abstract

Heterozygous FOXL2 (non-)coding sequence and structural variants (SVs) lead to blepharophimosis, ptosis and epicanthus inversus syndrome (BPES), a rare, autosomal dominant developmental disorder characterized by a completely penetrant eyelid malformation and incompletely penetrant primary ovarian insufficiency (POI). We collected variants from our in-house database, generated via clinical genetic testing and downstream research testing in the Center for Medical Genetics Ghent, Belgium (2001-202...

18
Breaking the 7 Mb barrier: Clinical cohort validation of genome-wide NIPT with fetal fraction enrichment and BinDel for detection of 1 Mb microdeletions and -duplications
2026-02-11 genetic and genomic medicine 10.64898/2026.02.10.26345955
Top 0.8% (12.3%)
Show abstract

ObjectiveTo evaluate the analytical and clinical performance of fetal fraction (FF) enriched genome-wide noninvasive prenatal testing (GW-NIPT) for detection of clinically relevant copy number variants (CNVs) down to 1 Mb. MethodsWe retrospectively analyzed 10,501 singleton pregnancies tested with FF enrichment-based GW-NIPT between August 2023 and July 2025. CNV analysis was performed using BinDel and WisecondorX. ResultsFF enrichment increased median FF to 24% (2.4-fold increase). Clinically...

19
Long-read nanopore sequencing uncovers population-specific structural variation in the Middle East and North Africa
2026-02-23 genetic and genomic medicine 10.64898/2026.02.20.26346743
Top 0.9% (12.2%)
Show abstract

Structural variants (SVs) are a major source of genomic diversity and disease susceptibility; however, populations from the Middle East and North Africa (MENA) region remain critically underrepresented in global reference databases. We provide the first detailed catalogue of structural variation in 61 individuals from diverse MENA countries, using publicly available ultra-long Oxford Nanopore sequencing. A scalable and dual-reference alignment-based method (GRCh38 and T2T-CHM13) was employed to ...

20
Investigating penetrance of severe combined immunodeficiency variants in an adult population cohort: implications for genomic newborn screening
2026-02-18 genetic and genomic medicine 10.64898/2026.02.17.26346478
Top 0.9% (11.9%)
Show abstract

Severe combined immunodeficiency (SCID) is a heterogeneous, recessive disorder, associated with the onset of severe, recurrent infections in the first few months of life. SCID is fatal if left untreated, but outcomes can be significantly improved by prompt diagnosis and treatment, particularly prior to onset of infection. Consequently, SCID is already included in many newborn screening programmes around the world, as well as multiple international genomic newborn screening (gNBS) research progra...